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Abstract 

In the Flow Edge-Monitor Problem, we are given an undirected graph G = (V, E)^ an integer 
k > and some unknown circulation ip on G. We want to find a set of k edges in G, so that if 
we place k monitors on those edges to measure the fiow along them, the total number of edges 
for which the fiow can be uniquely determined is maximized. In this paper, we first show that 
the Flow Edge-Monitor Problem is NP-hard, and then we give two approximation algorithms: 
a 3- approximation algorithm with running time 0((m + n)^) and a 2- approximation algorithm 
with running time 0((m + n)^), where n = |V| and m = \E\. 

1 Introduction 

We study the Flow Edge-Monitor Problem (FlowMntrs, for short), where the objective is to find 
k edges in an undirected graph G = {V^ E) with an unknown circulation '0, so that if we place k 
flow monitors on these edges to measure the flow along them, we will maximize the total number 
of edges for which the value and direction of ^ is uniquely determined by the flow conservation 
property. Intuitively, the objective is to maximize the number of bridge edges in the subgraph 
induced by edges not covered by monitors. (For a more rigorous deflnition of the problem, see 
Section [2|) 

Consider, for example, the graph and the monitors shown in Figure [T] In this example we 
have k — 4: monitors represented by rectangles attached to edges, with measured flow values and 
directions shown inside. Thus we have '0(2, 3) = 4, '0(3, 8) = 2, '0(6, 4) = 7 and '0(1, 2) = 1. From 
the flow conservation property, we can then determine that 0(3,5) = 2, 0(8,6) = 2, 0(7,5) = 3 
and 0(5, 6) = 5. Thus with 4 monitors we can determine flow values on 8 edges. 

Our results. We flrst show that the FlowMntrs problem is NP-hard. Next, we study polynomial- 
time approximation algorithms. We introduce an algorithm called ct-Greedy that, in each step, 
places up to a monitors in such a way that the number of edges with known flow is maximized. We 
then prove that 1-Greedy is a 3-approximation algorithm and that 2-Greedy is a 2-approximation 
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Figure 1: A graph with 
4 monitors. 



algorithm. The running times of these two algorithms are 0((m + n)^) and 0((m + n)^), respec- 
tively, where n = \V\ and m = \E\. In both cases, our analysis is tight. In fact, our approximation 
results are stronger, as they apply to the weighted case, where the input graph has weights on 
edges, and the objective is to maximize the total weight of the edges with known flow. 

Related work. A closely related problem was studied by Gu and Jia [1] who considered a traffic 
flow network with directed edges. They observed that m — n+1 monitors are necessary to determine 
the flow on all edges of a strongly connected graph, and that this bound can be achieved by placing 
flow monitors on edges in the complement of a spanning tree. (The same bound applies to connected 
undirected graphs.) Khuller et al. [5j studied an optimization problem where pressure meters may 
be placed on nodes of a flow network. An edge whose both endpoints have a pressure meter will have 
the flow determined using the pressure difference, and other edges may have the flow determined 
via flow conservation property. The goal is to compute the minimum number of meters needed 
to determine the flow on every edge in the network. They showed that this problem is NP-hard 
and MAX-S NP-hard, and that a local-search based algorithm achieves 2-approximation. For planar 
graphs, they have a polynomial-time approximation scheme. The model in [5j differs from ours in 
that it assumes that the flow satisfies Kirchhoff's current and voltage laws, while we only assume 
the current law (that is, the flow preservation property). This distinction is reflected in different 
choices of "meters": vertex meters in [5] and edge monitors in our paper. Recall that, as explained 
above, minimizing the number of edge monitors needed to determine the flow on all edges is trivial, 
providing a further justiflcation for our choice of the objective function. 

The FlowMntrs problem is also related to the classical fc-cut and multi-way cut problems 
[SI El d] , where the goal is to flnd a minimum- weight set of edges that partitions the graph into 
k connected components. One way to view our monitor problem is that we want to maximize 
the number of connected components obtained from removing the monitor edges and the resulting 
bridge edges. 

2 Preliminaries 

We now give formal deflnitions. Let G = (V^ E) be an undirected graph. Throughout the paper, 
we use n — \V\ to denote the number of vertices in G and m = IE"! to be the number of edges. 
We will typically use letters u^v^x^y, possibly with indices, to denote vertices, and a^b^e^f^ ... 
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to denote edges. If an edge e has endpoints x, ^, we write e = {x, y}. We allow multiple edges and 
loops in G, so the endpoints do not uniquely define an edge: if e = {x^y} and / = {x^y}^ it is not 
necessarily true that e = /. 

A circulation on G is a function -0 that assigns a flow value and a direction to any edge in 
E. (We use the terms "circulation" and "flow" interchangeably, slightly abusing the terminology.) 
Denoting by tlj{u,v) the flow on e = {u,v} from u to we require that '0 satisfies the following 
two conditions (i) ip is anti-symmetric, that is iIj{u^v) — —iIj{v^u) for each edge {u^v}^ and (ii) ijj 
satisfies the flow conservation property, that is Yl,{uv}eE^{^'>^) ~ ^ each vertex v. 

A bridge in G is an edge whose removal increases the number of connected components of G. 
Let Br{G) be the set of bridges in G. 

Suppose that some circulation ij; is given for all edges in some set M C and not for other 
edges. We have the following observation: 

Observation 1. For {u^v} ^ E — M, ip(u^v) is uniquely determined from the flow preservation 
property if and only if {u^ v} G Br{G — M). 

We can now define the gain of M to be gain{G^M) — \M U Br{G — that is, the total 

number of edges for which the flow can be determined if we place monitors on the edges in M. We 
will refer to the edges in M as monitor edges, while the bridge edges in Br{G — M) will be called 
extra edges. If G is understood from context, we will write simply gain{M) instead of gain{G^M). 

The Flow Edge-Monitor Problem (FlowMntrs) can now be defined formally as follows: given 
a graph G = iV^E) and an integer /c > 0, find a set M C with \M\ < k that maximizes 
gain{G,M). 

The weighted case. We consider the extension of FlowMntrs to weighted graphs, where each 
edge e has a non- negative weight w{e) assigned to it, and the task is to maximize the weighted 
gain. More precisely, if M are the monitor edges, then the formula for the (weighted) gain is 
gain{M) = Y^^^mub ^i^)^ ^ = 5r(G - M). We will denote this problem by WFlowMntrs. 

Throughout the paper, we denote by M* some arbitrary, but fixed, optimal monitor edge set. 
Let = Br{G — M*) be the set of extra edges corresponding to M*. Then the optimal gain is 
gain%G,k) = i(;(M*US*). 

SimpUfying assumptions. We make some assumptions about the input graph G that will sim- 
plify the proofs. First, if A; > m, then we can simply take M — E and this will be an optimal 
solution to WFlowMntrs. Therefore, without loss of generality, throughout the paper we will 
assume that m > k. 

The flow value on any bridge of G must be 0, so we can assume that G does not have any bridges. 
Further, if G is not connected, we can do this: take any two vertices v from different connected 
components and contract them into one vertex. This operation does not affect the solution to 
WFlowMntrs. By repeating it enough many times, we can transform G into a connected graph. 
Summarizing, we conclude that, without loss of generality, we can assume that G is connected and 
does not have any bridges. In other words, G is 2-edge-connected. (Recall that, for an integer c > 1, 
a graph H is called c-edge-connected, if H is connected and it remains connected after removing 
any set of at most c — 1 edges from H.) 
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Next, we claim that can in fact restrict our attention to 3-edge-connected graphs. To justify 
it, we show that any weighted 2-edge-connected graph G — iV^E) can be converted in hnear time 
into a 3-edge-connected weighted graph G' — {V ^ E') such that: 

(i) gairf(G,k) = gairf{G\k), and 

(ii) If C E"^ is a set of k monitor edges in G\ then in hnear time one can find a set M C of 

k monitor edges in G with gainiG^ M) — gain(G\ M'). 

We now show the construction of G^ A 2-cut is a pair of edges {e, e^} whose removal disconnects 
G. Write e ^ if {e, e^} is a 2-cut. It is known, and quite easy to show, that relation "2^" is an 
equivalence relation on E. The equivalence classes of are called edge groups. 

Suppose that G has an edge group F with |F| = for q > 2, and let Hq be the connected 

components of G — F. Then F = {ei, e^}, where, for each z, = {ui^ Vi}, ui G Hi and vi E 
(for i — q we assume g + 1 = 1). For z = — 1, contract edge ei so that vertices ui and vi 

become one vertex, and then assign to edge Cq — {uq^ Vq] weight Y^\^i w{ei). We will refer to Cq as 
the deputy edge for F. Figure [2] illustrates the construction. 




Let G^ = (V\ E') be the resulting weighted graph. By the construction, G' is 3-edge-connected. 
All edge groups can be computed in linear time (see, [7J, for example), so the whole transformation 
can be done in linear time as well. 

It remains to show that G' satisfies conditions (i) and (ii). If M is any monitor set, and if M 
has two or more monitors in the same edge group, we can remove one of these monitors without 
decreasing the gain of M. Further, for any monitor edge e of M, we can replace e by the deputy 
edge of the edge group containing e, without changing the gain. This implies that, without loss 
of generality, we can assume that the optimal monitor set M* in G consists only of deputy edges. 
These edges remain in G' and the gain of M* in G' will be exactly the same as its gain in G. This 
shows the ">" inequality in (i). The "<" inequality follows from the fact that any monitor set in 
G' consists only of deputy edges from G. The same argument implies (ii) as well. 

Summarizing, we have shown in this section that, without loss of generality, we can assume 
that the input graph G has m > k edges and is 3-edge-connected. 
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The kernel graph. Consider an input graph G — {V^ E) and a monitor edge set M, and let 
B — Br(G — M). The kernel graph associated with G and M is defined as the weighted graph 
Gm — {Vmi Em), where Vm is the set of connected components ofG — M — B, and Em is determined 
as follows: For any edge e G MUS, where e = {u^ v}, let and be the connected components of 
G — M — B that contain, respectively, u and v. Then we add edge {u'^ v'} to Em- The weights are 
preserved, that is w{{u\v^}) = w{{u,v}). We will say that this edge {u\v^} represents {u,v} or 
corresponds to v}. In fact, we will often identify {u^ v} with v^}^ treating them as the same 
object. We point out that in general Gm is a multigraph, as it may have multiple edges and loops 
(even when G does not). However, since G is 3-edge-connected, it is easy to see that so is Gm- 

Figure [3] shows the kernel graph corresponding to the graph and the monitor set in the example 
from Figure [1] (all edge weights are 1): 




Note that we have \Em\ < + \ Vm \ — 1- This can be derived directly from the definitions: The 
edges in Gm that represent extra edges are the bridges in Gm and therefore they form a forest in 
Gm- This (and the fact that Gm is connected) implies that the number of extra edges is at most 
\Vm\ — 1, and the inequality follows. 

In the paper, we will use the concept of kernel graphs only with respect to some optimal monitor 
set. Let M* be some arbitrary, but fixed, optimal monitor edge set. To simplify notation, we will 
write G* = (y*,£^*) for the kernel graph associated with M*, that is G* = Gm*, V^* = Vm* and 
£■* = Em*- In this notation, we have gain''{G,k) = w{E''). In the analysis of our algorithms, we 
will be comparing the weights of edges collected by the algorithm against the edges in the kernel 
graph G*. 

3 Proof of NP-hardness of FlowMntrs 

We show that the FlowMntrs is NP-hard (even in the unweighted case), via a reduction from 
the Clique problem. We start with a simple lemma. 

Lemma 2. Let ai, a2, . . . , he s positive integers such that Yld=i — /^^ ^ fixed integer n. 
Then Yld=i (2') maximized if and only if aj — n — s + 1 for some j and ai — 1 for all 1^3. 

Above, we assume that (2) = 1(1 — 0)/2 = 0. 

Proof. By routine algebra, one can verify that for 2 < a < 6 we have 
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Without loss of generality, assume ai = max^ a^. If > 1, for any z > 1, by the inequality above, 
we can change ai + 1 and ai^ai — 1, increasing the value of Yld=i (2')- repeating this 
argument, we obtain that an optimum is achieved for ai = n — 5 + 1 and a2 — a-^ — ... — as — 1. 
That all optima have this form (up to a permutation of the a^'s) follows from the fact that inequality 
^ is strict. □ 



Theorem 3. Flow^Mntrs is UP -hard. 

Proof. In the Clique problem, given an undirected graph G — {V^ E) and an integer g > 0, we 
wish to determine if G has a clique of size at least q. Clique is well-known to be NP-complete 
(see [2j). We show how to reduce Clique, in polynomial-time, to DecFlowMntrs, the decision 
version of FlowMntrs, defined as follows: Given a graph G — {V^E) and two integers, k,l > 0, 
is there a set M of k edges in G for which \Br{G — M)\ > 11 

The reduction is simple. Suppose we have an instance G = {y^E)^q of Clique. Without loss of 
generality, we can assume that G is connected and g > 3. Let n = |y| and m —\E\. We map this 
instance into an instance G, fc, / of DecFlowMntrs, where A: = m — (^) — / and I ^ n — q. This 
clearly takes polynomial time. Thus, to complete the proof, it is sufficient to prove the following 
claim: 

(*) G has a clique of size g iff G has a set M of A: edges for which \Br{G — M)\ > I. 

We now prove (*). The main idea is that, by the choice of parameters k and Z, the monitors and 
extra edges in the solution of the instance of DecFlowMntrs must be exactly the edges outside 
the size-q clique of G. 

(^) Suppose that G has a clique C of size q. Let G^ be the graph obtained by contracting C 
into a single vertex and let T be a spanning tree of G^ We then take M to be the set of edges of 
G' outside T. Thus the edges in T will be the bridges of G — M. Since G' has n — g + 1 vertices, 
T has I — n — q edges, and M has m — — I — k edges. 

(<^) Suppose there is a set M oi k monitor edges that yields a set B of V extra edges, where 
I <l' < n — 1. We show that G has a clique of size q. 

Let s be the number of connected components of G — M — 5, and denote by ai, a2, the 
cardinalities of these components (numbers of vertices). Since \B\ — I' ^ we have s > I' + 1. Also, 
Yld=i ^ ^ ^iid J2i=i (2') k -\- r > m. Therefore, using Lemma [2j and the choice of k and /, we 
have 




By routine calculus, the function f{x) = 2 ~ ^)(^ ~ ^ ~ 1) + ^ is decreasing in interval [0, n — 1], 
and therefore the above derivation implies that < /, so we can conclude that V — I. This, in turn. 
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implies that all inequalities in this derivation are in fact equalities. Since ([2]) is an equality, we have 
5 — l = = / = n — Then, since ([s]) is an equality. Lemma [2] implies that aj = q for some j 
and ai — 1 for all i ^ j. Finally, Q can be an equality only if all the connected components are 
cliques. In particular, we obtain that the jth component is a clique of size q. □ 

4 Algorithm cr- Greedy 

Fix some integer constant cr > 1. Let G — {V^E) be the input graph with n — \V\^ m — \E\^ 
and with weights on edges. As justified in Section [2} we will assume that m > k and that G is 
3-edge-connected. 

Algorithm ct-Greedy that we study in this section works in [fc/a] steps and returns a set of k 
monitor edges. In each step, it assigns a monitors to a set P of a edges that maximizes the gain 
in this step, that is, the total weight of the monitor edges in P and the bridges in G — P. These 
edges are then removed from G, and the process is repeated. A more rigorous description is given 
in Figure [4| which also deals with special cases when the number of monitors or edges left in the 
graph is less than a. 

Algorithm a- Greedy 

Mo^0 

for t^l,2,..., \k/a'] 
if Et-i = 

then return M — Mt-i and halt 

if t = [k/a\ +1 

then a' — k mod a 
if \Et-i\ < a' 

then P ^ Et-i 

else 

find P C Et-i with |P| = a' 

that maximizes w{PU Br{Gt-i — P)) 
Yt^PUBr{Gt-i-P) 
Xt^Xt_iUYt 
Et^Et-i - Yt 
Gt^{V,Et) 
Mt ^ Mt-i U P 
return M — Mp^^/^r] 

Figure 4: Pseudo-code for Algorithm ct-Greedy. Yt represents the edges collected by the algorithm 
in step t, with P CYt being the set of monitor edges and Yt — P the set of extra edges. Mt represents 
all monitor edges collected up to step t and Xt represents all edges collected up to step t. 

Note that each step of the algorithm runs in time 0(m^(n + m)), by trying all possible combi- 
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nations of a edges in the remaining graph Gt-i to find P. Hence, for each fixed a, Algorithm a- 
Greedy runs in polynomial time. 

4.1 Analysis of 1-Greedy 

In this section we consider the case a — 1. Algorithm 1- Greedy at each step chooses an edge whose 
removal maximizes the gain and places a monitor on this edge. This edge and its corresponding 
bridges are removed from the graph. This process is repeated k times. We show that this algorithm 
has approximation ratio 3. 

Analysis. Fix the value of A:, and some optimal solution M* of k monitor edges, and let G* = 
(y*, £■*) be the corresponding kernel graph. To avoid clutter, we will identify each edge in £"* with 
its corresponding edge in thus thinking of £"* as a subset of E. For example, when we say that 
the algorithm collected some e G we mean that it collected the edge in E represented by e. 

Recah that gairf{G, k) = w{E''), where w{E'') is the sum of weights of the edges in Thus 
we need to show that 1-Greedy's gain is at least \w{E''). 

Let e^, z = 1, 2, A:, be the k heaviest edges in £"*, ordered by weight, that is w{ei) > w{e2) > 
... > w(ek). First, we claim that for each t = 0, 1, A:, we have 

y^m > T!i=Mei). (5) 

where Xt denotes the set of edges collected by the algorithm in the first t steps. 

The proof of ([5]) is by a straightforward induction. It is vacuously true for t = 0. Suppose 
^ holds for = t - 1; we will then show that it also holds for t. If Xt contains all edges 
ei,...,e^, then Q holds. Otherwise, choose any j, 1 < j < t, for which Cj ^ Xt. By induction, 
w{Xt-i) > '^(^O? and ej is available to 1-Greedy in step t, so its gain in step t is at least 

w{ej). Therefore w{Xt) > w{Xt-i) + w{ej) > w{Xt-i) +w{et) > Yll=i completing the proof 

of (§. 

Since G is 3-edge-connected, each vertex in G* has degree at least 3, so > which 
implies that k > - + 1 > Thus, from g, we obtain that the erain of 1-Greedy is 

"^i^k) > 'l2i=i^{^i) ^ |'^(^*)- Summarizing, we obtain: 

Theorem 4. Algorithm 1-Greedy is a polynomial-time 3- approximation algorithm for the Weighted 
Flow Edge-Monitor Problem, WFlowMntrs. 

With a somewhat more careful analysis, one can show that the approximation ratio of 1-Greedy 
is actually 3(1 — l/fc), which matches our lower bound example below. 

A tight-bound example. We now present an example showing that our analysis of 1-Greedy 
is tight. Graph G consists of one connected component with 2k — 2 vertices, in which each vertex 
has degree 3 and each edge has weight 1, and the other connected component that has only two 
vertices connected by A: + 2 edges each of weight 1 + e. Fig. [5] shows the construction for A: = 5. 

1-Greedy will be collecting edges from the 2-vertex component on the left, ending up with k 
edges and total gain (1 + e)A:. The optimum solution is to put k monitors in the cubic component 
on the right, thus gaining all 3A: — 3 edges from this component. For 6^0, the approximation 
ratio tends to 3(1 — 1/A:). 
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Figure 5: Lower bound example 
for 1-Greedy, with = 5. 



4.2 Analysis of 2- Greedy 

We now consider a — 2. At each step, Algorithm 2-Greedy selects two edges that maximize 
the gain. Ties are broken arbitrarily. We place monitors on these two edges, and then remove 
them from G, as well as the resulting bridges. The process continues for \_\k\ steps. Exceptional 
situations, when k is odd, or we run out of edges, etc., are handled as in Figure [4j In this section 
we show that 2-Greedy is a 2-approximation algorithm. 

Idea of the analysis. Let G = (y, E) be the input graph with weights on edges. As before, we 
will assume that m > k and that G is 3-edge-connected. We can further assume that 2-Greedy 
never runs out of edges, for otherwise it computes an optimal solution. We fix some optimal solution 
M*, and let G* = (y*, be the corresponding kernel graph with u = vertices and /i = 
edges. Recall that gairf{G^k) = w{E'^); thus we need to show that 2-Greedy's gain is at least 

Our proof for 1-Greedy was based on the observation that k > thus we only needed 

to show that 1-Greedy's gain is at least the total weight of the k heaviest edges in E'*. This 
is not sufficient for 2-Greedy. To see this, consider a simple situation where G* is unweighted 
with all vertices of degree 3, the extra edges form a tree in G*, and k is even. Then = |z/ and 
fi^u-l-k — l^sofi^ 3k. Therefore we need to show that in this case 2-Greedy collects at least |A: 
edges. For the unweighted graph, this is not hard to show: pick a set J of ^fc independent vertices 
in G*. For each vertex v in J, at each step of 2-Greedy, either the three edges of v have already 
been collected, or they can be collected in this step by placing monitors on two of them. (It is also 
possible that one edge out of v has already been collected, but in this case the remaining two can 
be collected in this step as well.) Therefore the gain of 2-Greedy in ^k steps will be at least the 
number of edges out of J, that is 

If G is weighted, the argument above is not sufficient, since the edges incident to vertices in 
J may have small weights. Further, it may happen that 2-Greedy will collect only k edges in 
total, if they are heavy enough, and in this case we need to argue that their total weight is at least 
^ even though k may be only about 

The proof consists of two parts. First, we introduce the concept of a bundle. Intuitively, a 
bundle is a set of edges that can be collected with at most two monitors, although our definition 
is more restrictive and will be given shortly. We show that G* contains a set T of at most l^k\ 
disjoint bundles with total weight at least ^w{E'^). In the second part of the proof we show that 
the gain of 2-Greedy is at least the total weight of T. 
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Analysis. For any vertex in G* of degree 3, the set of three edges incident on this vertex is cahed 
a tripod. A set of edges is cahed a bundle if is either a tripod or it consists of at most two edges. 
Clearly all edges of a bundle can be collected with at most 2 monitors. If T is a set of bundles, by 
Et we denote the set of edges in T, that is Et = U^gt ^^^^ extend the definition of bundles 

later in the proof of Lemma |6j) 

Lemma 5. For k >2, there exists a set T of at most \_k/2\ disjoint bundles such that w{Et) > 

He*). 

Proof. First, we construct a collection Z of bundles that contains all tripods in G* and such that 
each edge appears in exactly two bundles in Z. To this end, we create two copies of each edge. If 
e = {x, y} is an edge, then by and we will denote these two copies of e. Let F be the set of 
all these edge copies. 

For each vertex v of G* of degree 3, if e, /, g are the edges incident to add the tripod 
f3 = {e^, f^,g^} to Z and remove e^, and from F. Let F^ be the set of remaining edges in F. 

We now want to partition F^ into pairs, with each pair becoming a bundle. Assume first that 
\F^\ is even; the argument for the general case is essentially the same but slightly more cumbersome 
because of the parity issue, and it will be explained later. 

Group the edges in F^ arbitrarily into pairs. As long as any of these pairs has two copies of the 
same edge, say {e^,e^}, do this: take any other pair {f^^g^} (where possibly f — g) and replace 
these two pairs by pairs {e^,/^} and {e^,^^}. Eventually each pair will contain different edges. 
All these pairs become bundles that are now added to Z. This completes the construction of Z. 

To construct T, we now greedily extract from Z non-overlapping bundles with maximum weight. 
More specifically, we start with T = 0, and repeat the following step as long as \Z\ > 4: choose a 
bundle (3 in Z with maximum w{/3), add it to T, and then remove from Z exactly four bundles: 
all other bundles in Z that intersect /?, and possible a few more additional bundles so that the total 
is four. This is possible, because, by the definition of Z, each bundle in Z intersects at most three 
other bundles in Z. If, at the end, \Z\ ^ 0, then we add to T the bundle /3 in Z with maximum 
w{f3) and remove all remaining bundles (at most three) from Z. 

We now have our set T of bundles, and it remains to show that it has the desired properties. 
Obviously, all bundles in T are disjoint. In each step of the construction of T, when we add a 
bundle f3 to T, we reduce w{Ez) by at most 4it;(/3), so w{Et) > \w{Ez) = ^it;(£'*), as needed. 

It remains to show that \T\ < [^k\. Let be the number of vertices of degree d in G*. All 
vertices in G* have degree at least 3, thus 

2^ = J2d>3d-^d > 3z/3 + 4(z/ - z/3) = 4Z/-Z/3. 

We also have /j. < z/ + A: — 1, which, together with the inequality above yields 2/x — z/3 < 4A: — 4. In Z, 
we have exactly tripods and ^(2/i — 3z^3) — iJi — |z^3 pairs, so we obtain \Z\—ii— ^z/3 < 2A; — 2. 
Hence |T| < \{2k - 2)/4] = \{k - l)/2] < [\k\, as needed. 

Now we deal with the case when is odd. We execute the same process for pairing edges, but 
in this case we will end up with one unpaired edge that will become a bundle by itself. The proof 
that w{Et) > ^w(E^) remains valid. We still need to show that |T| < [^fcj. In Z, we have us 
tripods, ^(2/x — 3z/3 — 1) pairs, and one singleton; hence \Z\ = z/3 + ^(2/x — 3z/3 — 1) + 1 = |(2/i — Z/3 + I). 
As before, we have 2/x — z^3 < 4A: — 4, but now 2ji — v-^ — {2ji — 3^/3) + 2z^3 = |^^| + 2z^3 is odd, 
which implies that we in fact have 2/x — z/3 + 1 < 4A: — 4. This implies \Z\ <2k — 2, and the bound 
|r| < [k/2\ follows, as before. □ 
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Lemma 6. Let k > 2, and let T he the set of bundles constructed in Lemma^ Then w{X^j./2\) ^ 
w(Et); thus 2-Greedy^5 gain is at least w(Et). 

Proof. Let ^ = [k/2\. To prove the lemma, we show that there is a partition of Et into disjoint 
sets 5i, ^2, Bi (some possibly empty) such that w{Bt) < w{Yt)^ for t = 1, 2, where Yf is the 
set of edges collected by 2-Greedy in step t. This is clearly sufficient to establish the theorem. 

The idea of the proof is to proceed one step at a time, for t = 1, 2, starting with T' ^ T 
and at each step reducing the number of bundles in T' . The invariant will be that at each step t, 
all bundles in T' will be available to 2-Greedy for collection. At step t, we eliminate from these 
bundles all edges collected by the algorithm, and rearrange some bundles, if necessary, in such a 
way that the number of bundles in T' strictly decreases. 

To implement the above idea we need to extend the definition of bundles. If /? = {e, /, g} is a 
tripod in G* and 2- Greedy collects e at some step t' <t while f^g remain uncollected until step 
t, then the pair [3' = {f^g} is called a hipod at step t. If C £;* is a set of edges not collected 
in the first t — 1 steps, then [5 is called a bundle at step t if either (i) is a tripod in G*, or (ii) 
(3 = /?! U /32, where each is either an edge or a bipod or 0. (We will omit phrase "at time t" 
whenever t is understood from context.) Clearly, this definition extends the one given earlier in 
this section. However, it is still true that 2-Greedy can collect all edges from a bundle in a single 
step (that is, with two monitors). 

Now we describe the construction of the sets Bt. Initially, T' = T. Suppose that, for some 
t > 1, we have already constructed Si, ...,Bt-i and modified accordingly. Consider now step t 
of 2- Greedy. We distinguish three cases. 

Case 1 : Yt intersects at most one bundle in T^ If Yf intersects one bundle, let be this bundle; 
otherwise, let be any bundle. Remove (3 from T and set Bt = (3. 

Case 2 : Yf contains a bundle in T^ In this case we simply set Bf = Yf H E^' and we update T' 
as follows: remove from T' all bundles that are contained in Yt^ and for each other bundle [5 in T' 
that intersects y^, remove from jS the edges in Yt (that is, [3^ (3 — Yt). 

Case 3 : Yt intersects at least two bundles in T' but it does not contain any. We let Bt ^Yt(^ Et'. 
To update T', we proceed in two steps. First, choose any two bundles /3i,/32 in T' intersected by 
It, and replace them by f^iU [^2 — Yt. Then, for each other bundle in intersected by 1^, remove 
from all edges in Yt. 

Note that Cases 2 and 3 are not mutually exclusive. If, at some steps, both of these cases apply, 
any of them can be chosen arbitrarily. 

We claim that this procedure is correct, in the sense that at each step t all elements of 
are bundles. To this end, we make two observations. If = {e,/, ^} is a tripod at time t, then 
2-Greedy can either collect one edge from (3 or all three, but it cannot collect just two. If one edge 
is collected, the remaining edges form a bipod. Also, if {e, /} is a bipod at time t, then 2-Greedy 
either collects both edges e, / or none. If we remove any edges from a bundle, it obviously remains 
a bundle. Another type of update occurs in Case 3 where we replace f3i and /32 by f3i U [^2 — Yt. 
Here, by the case condition, each f3i — Yt is either an edge or a bipod, and thus /3i U /32 — Yt is a 
correct bundle. 

Next, we estimate the weight of the sets Bt. In Cases 2 and 3 we have Bt ^ Yt, so clearly 
^{Yt) > w{Bt). Due to the fact that it takes no more than two monitors to collect all edges in a 
bundle and 2-Greedy chooses Yt, w{Yt) is at least as large as the weight of any bundle in at 
time t; hence w{Yt) > w{Bt) holds for Case 1 as well. 
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Finally, note that we have no more than I bundles in T' to start with and the total number of 
bundles in T' strictly decreases in each step. Therefore after I steps we will have = 0, and thus 
Bi, B2, Bi is a partition of Et, completing the proof. □ 

In the case for = 1, 2-Greedy actually gets an optimal solution, and Lemma[5]and[6]combined 
imply that for /c > 2 the gain of 2-Greedy is at least half of the optimum. Thus, summarizing, 
we obtain our main result. 

Theorem 7. Algorithm 2-Greedy is a polynomial-time 2- approximation algorithm for the Weighted 
Flow Edge-Monitor Problem, WFlowMntrs. 

A tight-bound example. Our analysis of 2- Greedy is tight. The example is essentially the 
same as the one for 1-Greedy, illustrated in Figure [5) except that the edges in the 2-vertex 
component on the left side have now weights 1.5 + e. 2-Greedy will be collecting edges from this 
2-vertex component, so its total gain will be (1.5 + e)A:, while the optimum gain is 3A: — 3. For e ^ 
and A: ^ oc, the ratio tends to 2. 

5 Final Comments 

The most intriguing open question is what is the approximation ratio of a- Greedy in the limit for 
a ^ 00. We can show that this limit is not lower than 1.5, and we conjecture that 1.5 is indeed 
the correct answer. 

A natural question to ask is whether our results can be extended to directed graphs. It is not 
difficult to show that this is indeed true; both the NP-hardness proof and 2-approximation can be 
adapted to that case. 

Another intriguing direction to pursue would be to study the extension of our problem to 
arbitrary linear systems of equations. In that context, we can put k "monitors" on k variables of 
the system to measure their values. The objective is to maximize the number of variables whose 
values can be uniquely deduced from the monitored variables. 
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